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STOCHASTIC  REALIZATION  AND  INVARIANT  DIRECTIONS 


Abstract 


Invariant  directions  of  the  Riccati  difference  equation  of  Kalman 
filtering  are  shown  to  occur  in  a large  class  of  prediction  problems  and 
to  be  related  to  a certain  invariant  subspace  of  the  transpose  of  the 
feedback  matrix.  The  discrete  time  stochastic  realization  problem  is 
studied  in  its  deterministic  as  well  as  probabilistic  aspects.  In  par- 
ticular a new  derivation  of  the  classification  of  the  minimal  markovian 
representations  of  the  given  process  z is  presented  which  is  based  on 
a certain  "backward  filter  of  the  innovations.  For  each  markovian  repre- 
sentation which  can  be  determined  from  z the  space  of  invariant  direc- 
tions is  decomposed  into  two  subspaces,  one  on  which  it  is  possible  to 
predict  the  state  process  without  error  forward  in  time  and  one  on  which 
this  can  be  done  backward  in  time. 
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Introduction 

The  aim  of  this  paper  Is  to  extend  the  theory  of  Invariant  direc- 
tions of  the  matrix  Riccati  equation  to  a large  class  of  filtering  prob- 
lems, to  present  some  new  results  on  the  deterministic  and  probabilistic 
aspects  of  the  discrete  time  stochastic  realization  problem  and  to 
Illustrate  the  particular  features  introduced  in  stochastic  realization 
by  the  presence  of  invariant  directions. 

Part  1 of  the  paper  is  concerned  with  characterizing  Invariant  vec- 
tors for  the  usual  linear  least  squares  estimation  problem  in  additive 
white  noise.  We  extend  the  previous  results  on  the  colored  noise  prob- 
lem [8,  14,  29]  to  our  more  general  setting  and  present  some  new  ones. 
The  main  result  of  this  part  is  Theorem  1.6  which  provides  different 
necessary  and  sufficient  conditions  for  invariance.  These  conditions  are 
phrased  in  terms  of  the  convolution  of  two  weighting  patterns,  of  the 
optimal  control  of  the  dual  problem,  of  the  best  one  step  predictor  and 
of  the  feedback  matrix  T(t)  of  the  Kalman  filter.  The  latter  character- 
ization appears  here  for  the  first  time.  Indeed,  the  space  of  all  in- 
variant directions  is  simply  the  invariant  subspace  related  to  the  eigen- 
value zero  of  the  transpose  of  T(t)  for  t larger  than  a certain  value 
This  Interpretation  turns  out  to  be  quite  useful  and  enlightening,  since 

r(0  is  the  transition  matrix  of  the  estimation  error  and  it  is  essen- 
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tial  in  classifying  markovian  representations  in  the  stochastic  realiza- 
tion setting  (see  e.g..  Theorem  2.8).  Also  the  fact  that  invariant  vec- 
tors are  generalized  eigenvectors  sheds  new  light  on  the  proof  techniques 
employed  in  [8,  9,  29].  The  paper  [9]  by  Clements  and  B.  D.  0.  Anderson, 
which  contains  results  closely  related  to  conditions  (ii)  and  (lii)  of 
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Theorem  1.6,  became  available  co  us  right  after  the  first  version  of 
this  paper  was  submitted.  The  emphasis  in  [9],  however,  is  somewhat 
different  from  ours  in  that  the  authors  seek  to  characterize  invariance 
for  a very  general  form  of  the  linear  quadratic  regulator  problem, 
whereas  our  main  interest  lies  in  the  stochastic  implications  of  this 
phenomenon. 

The  second  part  of  the  paper  deals  with  discrete  time  stochastic 
realization  theory.  Given  a wide  sense  stationary  vector  process  z 
with  rational  spectral  density  $,  such  that  $(<»)  is  finite  and 
^(e140)  is  positive  definite  for  all  w,  and  a Hilbert  space  H con- 
taining the  components  of  z(t)  for  all  t,  consider  the  problem  of 
determining  all  minimal  markovian  representations  of  z ( stochastic 
realizations)  driven  by  a white  noise  with  components  in  H.  We  solve 
the  problem  in  the  following  way.  First  the  second  order  properties 
of  the  stochastic  realizations  are  described.  Our  results  integrate 
those  of  B.  D.  0.  Anderson  [3-5],  Faurre  [11,  12]  and  Ruckebusch  [33, 

34].  In  particular,  we  show  that  the  correspondence  in  [33;  p.70]  be- 
tween realizations  with  square  transfer  function  and  real  symmetric  so- 
lutions of  a certain  algebraic  matrix  equation  of  the  Riccatl  type  holds 
without  any  assumption  on  the  feedback  matrix.  Our  analysis  on  this 
aspect  of  the  stochastic  realization  problem  parallels  in  some  respects 
the  continuous  time  work  of  Lindquist  and  Flcci  [19]  . 

Then  we  turn  to  the  probabilistic  side  of  the  problem  which  has 
received  considerable  attention  in  recent  years  [1,  2,  18-23,  27,  32-36]. 
A tool  for  this  study  is  provided  to  us  by  Theorem  2.5,  which  establishes 
a correspondence  between  the  deterministic  as  well  as  stochastic  elements 
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of  realizations  evolving  forward  and  backward  in  time.  The  last  two 
sections  of  Part  2 are  devoted  to  a new  derivation  of  the  classification 
of  the  state  processes  of  stochastic  realizations  due  to  Ruckebusch  [33] 
in  discrete  time  and  Lindquist  and  Picci  in  continuous  time  [19].  Our 
approach  makes  essential  use  of  markovlan  representations  of  the  inno- 
vation process  with  the  estimation  error  as  the  state.  Ruckebusch  has 
used  the  error  process  in  finite  and  infinite  dimensional  stochastic 
realization  to  derive  a number  of  results  [33-35] , but  our  idea  of  asso- 

I 

elating  it  with  a stochastic  realization  of  the  innovations  appears  to 
be  new.  Tackling  the  problem  in  this  way  we  not  only  derive  the  main 
results  in  a rather  simple  manner,  but  we  also  gain  Insight  into  their 
meaning.  For  instance,  the  important  result  that  realizations  which  can 
be  constructed  from  only  the  process  z ( internal ) are  in  one  to  one 
correspondence  with  the  invariant  subspaces  of  the  feedback  matrix  T* 

(Theorem  2.8)  can  be  given  a natural  explanation  in  terms  of  the  back- 
ward filter  of  the  innovations  (see  Remark  2.10).  Last,  but  not  least, 
these  stochastic  realizations  of  the  innovation  process  provide  a key  to 
understanding  the  relationship  between  the  invariant  sub spaces  of  T* 
and  a certain  class  of  inner  functions  in  terms  of  which  it  is  possible 
to  describe  the  realizations  of  z [21,  35,  36].  Our  results  on  this 
subject,  however,  will  be  presented  elsewhere. 

Part  3 is  the  natural  continuation  of  Parts  1 and  2 in  that  it 
explores  how  invariant  directions  affect  the  family  of  stochastic  reali- 
zations. Indeed  the  space  of  invariant  vectors  I is  the  same  for  all 
realizations  and  is  nontrivial  if  and  only  if  $(°°)  is  singular.  The 
characterization  of  I as  the  invariant  subspace  of  the  transpose  of 
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I*  relative  to  zero  is  important  in  establishing  the  two  principal  re- 
sults of  Part  3.  The  first  is  Theorem  3.8  which  says,  loosely  speaking, 
that  in  an  invariant  direction  we  can  either  predict  or  smooth  the  state 
of  an  internal  realization  exactly  (i.e. , without  error),  showing  that 
7 is  closely  related  to  the  germ  space  of  z [23].  The  second  is 
Theorem  3.9  which  embeds  every  internal  realization  in  a chain  of  in- 
ternal realizations  (totally  ordered  with  respect  to  state  covariances) 
whose  minimum  element  has  a full  set  of  predictable  directions  [14]  and 
whose  maximum  one  has  a full  set  of  smoothable  directions  (Definition  3.7). 

The  last  section  of  Part  3 is  devoted  to  comparing  two  possible 
approaches  to  discrete  time  stochastic  realization  based  on  different 
factorizations  of  the  covariance  operator.  We  show  that  the  factoriza- 
tion leading  to  markovlan  representations  without  noise  in  the  output 
[1,  11]  considerably  narrows,  compared  with  the  other  approach,  the 
solution  class  of  the  stochastic  realization  problem  when  $(o»)  is 
singular.  This  deficiency  of  the  first  method  makes  it  advisable  to 
seek  markovlan  representations  of  the  type  considered  in  this  paper  un- 
less nonsingularity  of  $(«)  is  guaranteed. 

It  is  worthwhile  remarking  that  the  assumptions  made  on  the  process 
z in  Parts  2 and  3 are  mostly  for  simplicity.  Indeed  many  of  the  central 
results  can  be  established,  in  a suitably  modified  form,  in  the  nonsta- 
tionary case  under  mild  assumptions  on  z,  albeit  the  derivation  becomes 
more  involved.  This  explains  why  we  refrain  from  introducing  backward 
realizations  and  related  concepts,  like  that  of  smoothable  direction,  in 
the  setting  of  Part  1.  Our  results  on  this  matter  will  be  presented 


somewhere  else. 


Part  1 


INVARIANT  DIRECTIONS  OF  THE  MATRIX  RICCATI  EQUATION 


1.1  Basic  notation  and  formulation  of  the  problem. 

We  use  standard  vector-matrix  notation,  with  the  following  conven- 
tions. The  unit  matrix  Is  denoted  by  I,  the  transpose  of  a matrix  by 
prime.  All  vectors  without  prime  are  column  vectors.  N(R)  Indicates 

the  null  space  of  the  matrix  R.  If  R is  symmetric,  R > 0 (R  2 0) 

1/2 

means  R positive  (nonnegative)  definite.  If  R k 0,  R is  the 
unique  nonnegative  square  root  of  R.  The  Moore-Penrose  pseudoinverse 
[26]  is  denoted  by  #.  The  trace  operator  is  Indicated  by  tr.  The 
cone  of  symmetric,  nonnegative  definite  n * n matrices  is  denoted  by 
Cq.  Kronecker  symbol  is  5gt.  The  superscript  o Identifies  "optimal.'' 
Consider  the  linear  stochastic  model 


(1.1) 


(1.2) 


x(t  + 1)  - Ax(t)  + Bw(t) 


y(t)  - Cx(t)  + Dw(t) 


with  initial  condition  x(0)  * xq,  where  A,  B.  C and  D are  constant 

matrices  of  dimensions  n * n,  n * p,  m x n and  m * p,  xq  is  an 

n-dimensional  zero-mean  random  vector,  the  input  w is  a p-dimensional 

zero-mean  white  noise  sequence  uncorrelated  with  x , E{x  x'}  *■  P and 

o o o o 


E{w(s)w(t) '}  - 16 


1 


2 


a 

As  is  well-known,  the  besC  linear  least-squares  estimate  x(t)  of 
x(t),  given  the  data  (y(0),  ....  y(t  - 1)},  is  generated  recursively 
by  the  Kalman  filter 

(1.3)  £(t  + 1)  - Ax(t)  + K(t)[y(t)  - Cx(t)  ] £(0)  - 0 

where  K(t)  is  given  by 

(1.4)  K(t)  - (AZ(t)C'  + BD')(Cl(t)C'  + DD')# 
and  Z(t)  satisfies  the  Riccati  difference  equation 


(1.5) 


Z(t  + 1)  - AZ(t)A' 

- (AZ(t)C'  + BD')(Cl(t)C'  + DD')#(CZ(t)A'  + DB') 
+ BB  * 

Z(0)  - po 


We  shall  indicate  the  solution  of  (1.5)  at  time  s by  Z(s;  Pq)  when 
we  intend  to  emphasize  the  dependence  on  the  initial  condition  P . 


Definition  1.1  ([8]).  The  n-dlmensional  vector  a is  called  an 

s -invariant  direction  of  equation  (1.5)  if  a'Z(t;  P ) - a'Z(s;  0)  for 

all  t 2 s and  all  P e C . 

o n 


We  shall  study  the  problem  of  characterizing  all  invariant  directions 
of  equation  (1.5). 


1.2  Preliminaries. 

In  this  section  we  transcribe  some  well  known  results  of  duality 
between  estimation  and  control  into  a form  best  suited  to  our  problem. 
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We  refer  the  reader  to  [24]  for  the  variational  principles  underlying 
this  duality. 

Since  x(t  +1)  is  in  the  linear  span  of  y(0) , . ..,  y(t)  there 
exist  matrices  U(s,  t)°  for  s - 0,  ...»  t such  that  x(t  + 1)  - 
”^s-0  ^u^a»  Such  sequence  is  optimal  for  the  following  dual 

problem : find  U(t)  — (U(0,  t),  •••  , U(t,  t))  which  minimizes 


t 

(1.6)  tr{J[U(t)  ]}  - tr{Q(-l,  O'POM,  t)  + £ Z(s,  t) 'Z(s,  t)} 

s-0 

where 


(1.7)  Q(s  - 1,  t)  - A'Q(s,  t)  + C’u(s,  t)  Q(t,  t)  - I 

(1.8)  Z(s,  t)  - B'Q(s,  t)  + D'U(s,  t) 


A standard  argument  yields  the  closed-loop  form  of  the  optimal  con- 
trol 

(1.9)  U(s,  t)°  - -K(s)'Q(s,  t)°  s-0,  ....  t 

Consider  the  linear  estimator  of  x(t  + 1)  given  by  y(t  + 1)  " 

“ls«0  ^s*  ° '7<s>-  Th®0  is  easily  seen  that 

t 

(1.10)  x(t  + 1)  - y(t  + 1)  - Q(-l,  t)'x  + l Z(s,  t)'w(s) 

s-0 

Introducing  the  quantities  P(s,  t)  - E{x(s) [x(t  + 1)  - y(t  + 1)]'}, 

R(s,  t)  * E{y(s)[x(t  +'l)  - y(t  + l)]f}  and  applying  the  operator 
E{*[x(t  + 1)  - y(t  + l)]f)  to  both  sides  of  (1.1)-(1.2)  we  obtain,  in 
view  of  (1.10),  the  following  adjoint  system 

(l.U)  P(s  + 1,  t)  - AP(s,  t)  + BZ(s,  t)  P(0,  t)  - PqQ(-1,  t) 


(1.12) 


R(s,  t)  - CP(s,  t)  + DZ(s,  t) 


The  terminology  Is  justified  by  the  fact  that,  setting  up  the  discrete 
nHn-twnm  principle  for  the  dual  problem  (1.11)  are  seen  to  be,  with  the 
appropriate  normalization,  the  adjoint  equations.  Let  us  note  that 


(1.13) 


R(s,  t)  - 0 


s - 0,  . . . , t 


is  a necessary  and  sufficient  condition  for  optimality  of  the  U(t)  se- 
quence. Whenever  A is  nonsingular  we  can  rewrite  (1.7)  in  the  form 

(1.14)  Q(s,  t)  - (A')_1Q(s  - 1,  t)  - (A')-1C0(s,  t)  Q(t,  t)  - I 


Hence  we  have  the  following  input-output  relations: 

(1.15)  Z(s,  t)  - l T(i)  o (s  - i,  t)  + B'(A')"S"1Q(-1,  t) 

i-0 

s 

(1.16)  R(s,  t)  - l T(i)Z(s  - i,  t)  + CASP  Q(-l,  t) 

i-0  ° 

MS 

where  the  weighting  patterns  T(>)  and  T(«)  are  defined  by 


(1.17) 


(1.18) 


T(i)  - 


T(i)  - 


D'  - B,(Ar)”1C'  i 

-B,(A,)”i""1Cf  i 


i-0 


i-0 


i > 0 


Combining  (1.14)  and  (1.15)  leads  us  to  the  Hamiltonian  system 
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(1.19) 


Q(a,  t)  Ra')"1  0Uq(8  - 1,  t)' 

P(s  + 1,  t)J  [bb^A')"1  a]|p(s,  t) 


-(A')_1C  1 

, D (s,  t); 
BD'  - BB ' (A f)  ■Lg 


fQ(-l.  t)l  fl 


lP(0,  t) 


Q(-l»  t) 


(1.20)  R(s,  t)  * [DB '(A')”1  C] 


Q(s  - 1,  t) 
|P(s,  t) 


+ [DDr  - DB,(A,)“1C,]D(s>  t) 


where  Q(-l,  t)  - (A')*  +1^  (AO^'U  (i,  t).  It  Is  clear  that  the 
weighting  pattern  T^(0  of  the  Hamiltonian  system  Is  just  the  convolu- 


tion of  T(»)  and  T(»). 


(1.21)  T_(i)  - [T  * T](i)  - V T(i  - j)T(j) 

j-0 


The  matrices  1^(0) , . . . , TH(n  - 1)  will  play  a central  role  In 
establishing  necessary  and  sufficient  conditions  for  invariance. 


1.3  Characterization  of  invariant  directions. 

Ve  study  the  case  where  A is  nonsingular.  This  assumption  enables 
us  to  derive  explicit  expressions  for  the  invariant  vectors.  (The  case 
where  no  restriction  is  placed  on  A and  on  the  definltness  of  the  cri- 
terion matrices  has  been  recently  investigated  in  [9]).  The  three  follow- 
ing lemmas  extend  known  results  to  our  more  general  setting. 


Lemma  1.2  The  vector  a is  an  a-invariant  direction  of  (1.5)  if 
and  only  if 


I 

'“I 
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(1.22)  a e W(Q(t  - s,  t)°)  for  all  t z s - 1 and  all  P e C 

o a 

Proof.  Observe  that  a control  U(t)  is  optimal  for  the  dual  problem 
if  and  only  if  it  minimizes  a'J[U(t)]a  for  all  a e Rn.  The  result 
now  follows  from  a straightforward  modification  of  the  argument  of 
Theorem  3 in  [29].  // 

Notice  that  optimal  quantities  in  the  dual  problem  depend  on  the 
terminal  weight  Pq.  To  keep  notations  simple,  we  shall  refrain  from 
explicitly  exhibiting  this  dependence. 

Remark  1.3  The  proof  of  the  sufficiency  part  in  Lemma  1.2  relies  on 
the  fact  that,  under  condition  (1.22),  U(t  - i,  t)°a  is  Invariant  over 
t i s for  i » 0,  ...»  s-1.  Moreover,  when  (1.22)  holds,  it  is  easily 
seen  using  (1. 7)-(1.9)  that  a e N(U(i,  t)°)  n N(Z(i,  t)°)  for 
i ■ 0,  ...,  t - s.  In  particular  it  follows  from  (1.10)  that  a'x(t  + 1)« 
a'  (Z(i,  t)°)  w(i),  where  x(t)  - x(t)  - x(t)  is  the  estimation 

error. 

The  mathematical  framework  set  up  in  the  previous  section  will  be 
useful  in  proving  the  following  result. 


V 


Lemma  1.4  The  vector  a satisfies  (1.22)  if  and  only  if 

(1.23)  a - - l (A')‘VX 
i-1  1 


where  the  m-dimensional  vectors  X, , X.,  ....  X are  such  that 

1 z s 


*ri 

(1.24)  l T_(i)  X,..  - 0 

i-0  H 


j m 1,  ...,  s 
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In  this  case  the  optimal  control  satisfies 


(1.25) 


U(t)  a ■ (0,  •••*  o,  A • •••>  A- ) 


Proof.  Assume  that  (1.22)  holds.  In  view  of  the  time  invariance  dis- 
cussed in  Remark  1.3,  we  can  set  - U(t  - i + 1,  t)°a  for 
i-1,  ....  s.  Expression  (1.23)  can  now  be  derived  using  (1.7)  recur- 
sively. Let  us  consider  the  input-output  relation  of  the  Hamiltonian 


system 


\ 3 

R(s,  t)  - [DBU')'"1  C]A^  p1  Q(-l,  t)  + ^(1)0(8  - 


i,  t) 


where 


Ra')”1  oj 
[bbha')"1  a| 


As  observed  in  Remark  1.3,  a € N(Q(-1,  t)  ) . Then  (1.24)  follows  from  the 
optimality  conditions  (1.13).  Conversely  suppose  a is  as  in  (1.23) 
with  the  X^:s  satisfying  (1.24).  Using  (1.9)  and,  recursively,  (1.7), 
we  obtain 

U(k,  t)°a  - -K(k)  '[(A')6"1"  + £ (A')HC'n(k  + i,  t)°]a 

i-1 

which,  together  with  (1.23)  yields 

U(k,  t)°a  - -K(k)  '{-  l (A')_iC'X 

i-1 


+ l (A')i_1C'[D(k  + i,  t)°a-X  . .]> 

i-1 
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A calculation  similar  to  that  found  in  the  proof  of  Theorem  8 in  [29], 
i.e.,  using  (1.4),  (1.5)  repeatedly  and  condition  (1.24),  shows  that 

s-t+k  . 

(1.26)  *00'  ^ (^C'W-W 

which,  inserted  into  the  previous  expression  for  U(k,  t)°a,  enables  us 
to  derive  U(k,  t)°a  ■ X for  k ■ t - s + 1,  ...,  t recursively. 

This  and  (1.7)  yield  Q(t  - s,  t)°a  - 0,  i.e.,  condition  (1.22).  Also 
(1.25)  now  follows  in  view  of  Remark  1.3.  This  completes  the  proof.  // 


A straightforward  extension  of  the  proof  of  Theorem  8 in  [29]  estab- 
lishes the  following  lemma. 

Lemma  1.5  A vector  a is  s -invariant  for  (1.5)  if  and  only  if  a.  is 
as  in  (1.23)  and 

s 

(1.27)  a'x(t  + 1)  - - l X'y(t  + 1 - i)  for  all  t S s - 1 

i-1  1 

Let  r(t)  denote  the  feedback  matrix  A - K(t)C. 


Theorem  1.6  The  following  statements  are  equivalent: 

(i)  a is  an  s-ircvariant  direction  of  (1.5). 

(ii)  a satisfies  (1.22). 

(Hi)  a is  as  in  (1.23)  and  (1.24)  holds. 

(iv)  a is  as  in  (1.23)  and  (1.27)  holds. 

(v)  a generates  the  same  s -dimens iona l cyclic  subspace  of  T(t)  ' 
for  all  t k s - 1 and  all  Pq  e (V;  this  invariant  subspace 
of  T(t)  ' i3  associated  with  the  eigenvalue  zero,  i.e., 

(T(t)  r)Sa  ■ 0.  Moreover  T(t  - s + 1) ' •••  F(t)  fa  - 0 for 


all  t 2 s - 1. 


I 





Proof.  The  equivalence  of  (i),  (ii),  (iii)  and  (iv)  follows  directly 
from  Lemmas  1.2.  1.4  and  1.5.  Suppose  a satisfies  (v)  and 
observe  that  relations  (1.7)  and  (1.9)  yield  the  expression  Q(t-s,  t)°» 
r(t  - s + 1)  ' •••  r(t)  '.  By  assumption  T(t  - s + 1) ’ •••  T(t)  'a  - 0 
and  (1.22)  follows.  Conversely,  if  we  assume  (iii),  we  derive  from 
(1.26)  and  the  last  part  of  the  proof  of  i-emma  1.4  the  relation 

S;J  -i  •-J-1  -i 

r<e>'Jl  <A'>  c'xi+j  ■ j1  ‘ V 

for  all  t a s - j - 1 and  all  Pq  e Cq,  where  j - 1,  ...,  s - 1 and, 
for  J ■ s - 1,  the  right  hand  side  is  defined  to  be  zero.  This  estab- 
lishes (v) . // 


Condition  (v)  of  this  theorem  is  new.  Its  importance  will  completely 
surface  in  the  stochastic  realization  setting. 

remark  1.7  ([8]).  The  sets  I of  s-invariant  directions  and  I ■ 

S 

of  invariant  directions  are  vector  spaces.  It  follows  from  the 

previous  theorem  that  I - un  . I . 

s-1  s 


Remark  1.8.  The  dimension  of  the  invcariant  subspace  I can  be  easily  de- 
termined in  the  single-output  case  y(t)  * c'x(t)  + d'w(t).  It  is  equal 
to  the  minimum  between  the  rank  of  the  observability  matrix 

[c  A'c  •••  (A')n  ^c] ' and  the  first  index  j such  that  T^(j  - 1)  ■ j 

*•*  ■ T^(0)  - 0 and  T^(j)  + 0.  The  general  case  is  rather  involved. 

We  shall  not  pursue  here  the  extension  of  the  results  of  [29)  on  this 
matter. 

g 

W 


r<Um^  1 1 - — ■' 


" — Y“- 

I j, I . ..  - 
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Let 

QO 

(1.28)  W(z)  - l T(i)z_±  - C(zl  - A)_1B  + D 

1-0 

be  the  transfer  function  of  (1.1)-(1.2)  and 
00 

(1.29)  WH(z)  - ^(Dz"1 

the  transfer  function  of  the  Hamiltonian  system.  The  following  charac- 
terization of  Tg( •)  will  be  helpful  in  the  third  part  of  the  paper. 

Theorem  1.9  Assume  a nonsingular.  Then 

(1.3 0)  WH(z)  - W(z)W(z_1) ' 

If  y in  (1.2)  is  stationary  with  spectral  density  $(z),  we  also  have 

(1.31)  VH(z)  - *(z) 

Proof.  Consider  W(z-1)  • - B'U"1!  - A')~^C ' + D'  - 

-B'(A')  ^(1  - z \a')  "S  ^C'  + D'.  Expand  the  last  term  in  a neighborhood 
of  infinity  as  follows: 

(1.32)  -Br  (Af)-1(I  - z"1(A,)"1)"1Cf  + D' 

- D»  - B' (A')_1C ' - B'(A')‘2C'z_1  - B ,(A')~3C  'z~2  ••• 

00 

- I T(i)z  1 
i-0 

Take  the  Cauchy  product  of  the  two  series  in  (1.28)  and  (1.32)  to  get 
(1. 30) . In  the  case  of  a stationary  y the  well-known  spectral  factor! - 


i 

li 


!■ 


zation  formula 


(1.33) 


$(z)  « W(z)W(z  ) ' 


yields  (1.31).  // 

Notice  that  the  calculations  in  the  previous  theorem  make  sense  be- 
cause the  series  in  (1.28)  and  (1.29)  converge  respectively  to  W(z) 
and  to  Wfl(z)  in  an  appropriate  neighborhood  of  infinity. 

Let  A(t,  s)  ■ E{y(t)y(s) '}  be  the  covariance  operator  of  the  ob- 
servations. It  is  a simple  matter,  using  the  expression  y(s)  ■ 

CA  n x (s  + n)  + T(i) 'w(s  + i)  which  can  be  derived  from  (1.1)- 
(1.2),  to  see  that  the  parameters  1^(0),  •••  , Tfl(n  - 1)  determine  the 
degree  of  "smoothness"  of  A(*,»),  i.e.,  the  number  of  differencing  opera- 
tions on  A(*,*)  necessary  in  each  direction  to  produce  a Kronecker  del-, 
ta.  This  number  has  been  named  in  the  scalar  case  relative  order  of  the 
aavariatvse , see  [14]  for  example.  This  fact  has  its  counterpart  in  the 
spectral  domain  in  Theorem  1.9. 

1.4  Predictable  directions. 

The  invariance  properties  of  invariant  directions  have  been  pointed 
out  by  several  authors  [8,  14].  Indeed,  as  it  is  apparent  from  Theorem 
1.6,  the  space  I is  Invariant  over  models  (1.1)-(1.2)  having  the  same 
covariance  of  the  output  and  the  same  (up  to  a change  of  basis  in  the  state 
space)  pair  (A,  C).  However,  if  a is  an  s-invariant  vector  for  (1.5) 
the  value  a'  £(s;  Pq)  does  depend  on  the  model.  A special  case  of  par- 
ticular interest  is  when  a e N(I(s;  P )). 
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Definition  1.10  ([14]).  The  n-dimensional  vector  a is  called  an 
s -‘predictable  direction  of  equation  (1.5)  if  a'Z(t;  Pq)  - a'Z(s;  Pq)  - 0 
for  all  t 2 s.  The  two  following  theorems  extend  some  results  of  Gevers 
[14]. 


Theorem  1.11  The  vector  a is  cm  s-predictable  direction  of  (1.5)  if 
and  only  if  a is  as  in  (1.23)  with  the  X satisfying 


(1.34) 


l T(i)X  - 0 
1-0  ^+i 


j — 1,  • • • * s 


Proof.  If  a is  s-predictable  a'x(t  + 1)  - 0 for  all  t 2 s - 1. 

Using  (1.10)  with  optimal  quantities  we  see  that  a e W(Q(-1,  t)°)  and 

a e n _ M(Z(i,  t)°)  for  all  t 2 s - 1.  Again  time  invariance  of 
1—0 , . • . , t 

the  optimal  control  can  be  shown  to  hold  and,  identifying  quantities  as 
in  (1.25),  we  get  (1.23)  from  Q(-l,  s - l)°a  - 0.  Also  (1.34)  follows 
from  (1.15).  To  prove  the  converse  first  observe  that  (1.34)  implies 
(1.24).  By  Lemma  1.4  a e M(Q(-1,  t)°)  and  (1.25)  holds.  From  (1.15) 
and  (1.10)  we  conclude  that  a'x(t  + 1)  - 0 for  all  t 2 s - 1,  i.e., 
a is  s-predictable.  // 


Theorem  1.12  Let  Z(s;  P ) > 0.  Then  Z(t;  P ) > 0 for  all  t 2 s 

o o 

if  and  only  if  T(0)  has  rank  m. 

Proof.  Let  X be  such  that  T(0)A  - 0.  Then  (A')  ^C'X  e W(Z(t;  P )) 

o 

for  all  t 2 1.  To  prove  the  other  half  we  use  induction.  Suppose 

Z(t  - 1;  P ) > 0 and  a e W(Z(t;  P )).  It  follows  from  the  principle 
o o 

of  optimality  that 
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(1.35)  0 - a'  E(t;  P )a  - min  {(a'A  + X'C)E(t  - 1;  P )(A'a  + C'X) 

o o 

XeRm 

+ (a  'B  + X 'D)  (B  'a  + D 'X) 

Let  X°-  be  the  optimal  value  in  (1.35).  Since  E(t  - 1;  PQ)  >0  we  get 
a - -(A')_1C'X°t  B'a  + D'X°  - 0 and  finally  (D ' - B '(A')"^ ')A°  - 0. 

If  T(0)  has  rank  m this  implies  that  a ■ 0.  // 

Remark  1.13.  Theorem  1.12  agrees  with  the  results  obtained  by  Silverman 
at  al.  [25,  30,  38].  In  fact,  the  presence  of  nontrivial  predictable  di- 
rections of  (1.5)  implies  that  the  system  (1.1)-(1.2)  is  not  strongly  ob- 
servable [38].  However,  it  can  well  happen  that  it  is  completely  observ- 
able (and  controllable).  In  the  third  part  of  the  paper  we  shall  study  a 
set  of  minimal  realizations  with  a nontrivial  invariant  and,  for  some  of 
them,  predictable  subspace. 

1.5  Discussion. 

Our  study  has.  shown  that  invariant  directions  can  occur  in  a more 
general  situation  than  just  the  noise-free  measurements  case  treated  in 
[8,  14,  29].  Conditions  (iv)  and  (v)  of  Theorem  1.6  provide  us  with  a 
probabilistic  interpretation  of  this  phenomenon.  In  an  invariant  direc- 
tion the  optimal  filter  depends  only  on  some  of  the  last  observation  in- 
stead of  the  whole  information  available.  This  fact  is  strictly  related 
to  the  invariant  subspace  of  T(t)'  corresponding  to  zero.  Moreover,  in 
the  case  when  y is  stationary  with  rational  spectral  density,  condition 
(iii)  of  Theorem  1.6  with  Theorem  1.9  shows  a precise  connection  between 
invariant  vectors  and  Che  spectrum  of  y.  All  of  chis  motivates  the  sto- 
chastic realization  approach  to  the  problem  taken  in  Part  3. 


h 


Finally  we  remark  that  this  theory  can  be  extended  in  a straightfor- 
ward manner  to  the  case  when  the  system  matrices  are  time-varying  replacing 
the  concept  of  invariant  direction  by  that  of  degenerate  direction  [14]. 

A reduction  of  the  order  of  the  Rlccati  equation  which  has  to  be  solved 
can  be  achieved  along  the  lines  of  [8]  whenever  invariant  (or  degenerate) 
directions  exist. 


Part  2 

DISCRETE  TIME  STOCHASTIC  REALIZATION: 

GENERAL  THEORY 

2.1  Notation  and  problem  formulation. 

Almost  sure  equality  between  random  vectors  Is  simply  indicated  as 
equality.  If  {£(t);  t e Z}  '.s  a second  order  vector  process  defined  on 
the  probability  space  (£1,  F,  P)  and  S a subset  of  the  integers  Z, 
we  denote  by  Hs(£)  the  closed  linear  hull  in  F,  P)  of  the  compo- 

nents of  £(t),  t e S.  We  shall  write  H(£),  Ht(£),  H*(£)  and  H(£(t)) 
instead  of  Hz(£),  H{zeZ|zSt}(£) , H{zeZ,z^t}(£)  and  H{t}(£)  respective- 

A 

ly.  Let  E{*|h  (£) } denote  the  orthogonal  projection  operator  onto 
s 

H (£) . We  abbreviate  E{*|H(£(t))}  as  E{*|£(t)}.  The  process  £ is 
s 

called  a wide  sense  vector  Markov  process  if 

E{£(s) |h~(£) } - E{£(s)|£(t)}  for  s 2 t 

or  equivalently 

e{£(s)|h+(£)}  - E{£(s)|£(t)>  for  s s t 

For  the  sake  of  brevity  we  shall  use  the  word  "Markov"  instead  of  the 
expression  "wide  sense  vector  Markov. " 
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We  shall  be  concerned  with  a wide  sense  stationary,  purely  nondeter- 
mlnlstic , m-dimensional  stochastic  process  {z(t);  t £ Z}.  The  process  z, 
defined  on  the  probability  space  (ft,  F,  P),  Is  assumed  to  be  centered  and 
to  have  a rational  spectral  density  $ such  that  $(•)  < 00 • The  finite- 
ness of  $(<»)  is  essential  only  in  Part  3 and  Is  assumed  here  for  simpli- 
city. The  matrix  function  $(•)  enjoys  the  following  properties:  each 
element  of  4 Is  analytic  on  the  unit  circle,  $ Is  discrete  parahermitian, 
i.e. , $(z)  ' ■ #(z  and  $(e*'aJ)  2 0 hermitian  for  all  real  u>.  In 

addition  we  suppose  that  z la  a minimal  ’process  [31]  which.  In  view  of 
the  rationality  of  Its  spectral  density.  Is  equivalent  to  ^(e^)  > 0 for 
all  u>.  This  assumption  too  Is  made  for  convenience  and  can  be  removed 
without  Impairing  the  main  results  of  Parts  2 and  3. 

In  many  problems  of  estimation  and  optimal  control,  when  given  a non 
Markov  process  z which  models  the  Information  flow.  It  is  necessary 
to  resort  to  an  auxiliary  Markov  process  x which  makes  £(t)  “ 
a Markov  process.  More  precisely  we  are  Interested  In  the  following  two 
problems. 

I.  Wide  sense  stochastic  realization  problem:  Determine,  from  the  know- 
ledge of  $,  all  quadruplets  [A,  B,  C,  D] , with  dimension  of  A minimal, 
such  that  the  process  y,  generated  by  the  dynamical  system  (1.1)  - (1.2) 
driven  by  an  arbitrary  normalized  white  noise  w,  has  the  same  spectral 
density  $ as  z. 

II.  Proper  stochastic  realization  problem:  Let  H be  a Hilbert  space 
such  that  H(z)  c H c (ft,  F,  P) . Given  H and  the  process  z find  all 
quintuplets  [A,  B,  C,  D;  w] , with  dimension  of  A minimal  and  w a nor- 
malized white  noise  satisfying  H(w)  c H,  such  that  y(c),  generated  by 


*(t)  } 

z(t  - 1)J 
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(1.1)  - (1.2)  and  z(t)  are  equivalent  random  vectors  for  all  t. 

We  shall  call  a solution  to  Problem  I a wide  sense  minimal  stochas- 
tic realization  and  a solution  to  Problem  II  a proper  minimal 1 stochas- 
tic realization.  It  is  1 cm ed late  that  to  each  proper  stochastic  realiza- 
tion there  corresponds  a (unique)  wide  sense  realization.  The  converse 
is  false.  To  attack  Problem  II  we  shall  choose  a route  passing  through 
the  solution  of  Problem  I,  with  the  intent  of  deriving  some  new  results 
along  the  way.  It  is  good  to  bear  in  mind,  however,  that  a direct  proba- 
bilistic approach  to  proper  stochastic  realization  is  possible  and  in  a 
sense  more  natural  [18,  20-22,  27,  35,  36]. 

2.2  Wide  sense  stochastic  realizations. 

Our  preliminaries  on  Problem  I are  based  on  the  important  work  of 
B.  D.  0.  Anderson  [3-5]  and  Faurre  [11,  12].  Problem  I is  equivalent  to 
the  classical  spectral  factorization  problem.  Find  all  minimal  stable 
spectral  factors  of  $,  i.e.,  all  matrices  W of  real  rational  functions 
of  minimal  McMillan  degree  [6]  and  with  all  their  poles  inside  the  unit 
circle  which  satisfy  (1.33).  Indeed,  if  [A,  B,  C,  D]  solves  Problem  I, 
then  W(z)  ■ C(zl  - A)  + D is  a stable  minimal  spectral  factor  of  ♦. 
Conversely,  any  such  W yields  a whole  class  of  wide  sense  stochastic 
realizations.  In  fact,  using  one  of  the  algorithms  [16,  39,  41]  available 
in  the  literature  we  can  compute  a minimal  [6]  realization  [A,  B,  C,  0] 
of  W.  Then  all  minimal  realizations  of  W given  by 

(2.1)  [T_1AT,  T_1B,  CT,  D]  T € GL^  A(R) 


From  now  on  we  shall  leave  the  word  minimal  out.  All  realizations 
are  to  be  intended  to  be  minimal  unless  the  opposite  is  explicitly  stated. 
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solve  Problem  I.  In  view  of  this  equivalence  Problem  I can  be  solved  as 
follows.  Express  by  means  of  partial  fractions,  as 


(2.2) 


*(z)  - S(z)  + S(z_1) ' 


2 

where  S is  a positive  peal  and  rational  function.  Let  [F,  G,  H,  J] 
be  a minimal  realization  of  S.  As  observed  before,  several  procedures 
are  known  to  determine  [F,  G,  H,  J]  which  is  unique  up  to  an  equivalence 
such  as  in  (2.1).  The  following  simple  lemma  allows  us  to  eliminate  J 
in  the  sequel. 

Lenina  2.1  Let  S be  the  positive  real  function  satisfying  (2.2)  and 
[F,  G,  H,  J]  a minimal  realization  of  S.  If  dim  F - n k 1,  then  F is 
nonsingular  and  J + J'  • G'(F')_1H'  + $(*) . 

Proof.  Taking  limits  in  (2.2)  we  see  that  $(<»)  » J + J'  + 

lim  G'(z  ^1  - F')  ^Hf,  since  S(z)  ■ H(zl  - F)  ^G  + J.  The  conclusion  now 

Z-K» 

follows  from  the  finiteness  of  $(<»)  and  the  minimality  of  [F,  G,  H,  J].// 

To  avoid  trivialities,  we  shall  assume  from  now  on  that  z is  not  a 
white  noise,  i.e. , dim  F m n k 1.  It  follows  from  Lemma  2.1  and  the  cele- 
brated Positive  Real  Lenina  (see  e.g.,  [28])  that  the  set  of  all  wide  sense 
stochastic  realizations  is  nonempty  and  given  by 

(2.3)  [A,  B,  C,  D]  - [T""^FT,  T*1^,  B2)V,  HT,  (R(P)1/2,  0)V] 


where  T e GLfl(R) , V is  any  p x p constant  orthogonal  matrix,  B^  is 
n x m,  B_  is  n x (p  - m)  (here  p 2 m is  arbitrary) , P is  n x n. 


A real  rational  function  with  no  pole  on  the  unit  circle  is  said 
to  be  (discrete)  positive  real  if  it  has  no  poles  outside  the  unit  circle 
and  S(e*u)  + S(e"^)'  £ 0 hermitian  for  all  real  w. 
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symmetric  and  positive  definite,  R(P)  is  the  nonnegative  definite  quan- 
tity G'CFVV  + Hm)  - HPH'  and  (P,  B^  B2)  solve  the  system 

(2.4)  P - FPFf  + B^  + B2B^ 

(2.5)  G - FPH'  + BjR(P)1^ 

It  is  no  restriction  to  choose  T - I and  V - I in  (2.3).  In  fact  all 
other  realizations  can  be  obtained  from  realizations  of  the  form 

(2.6)  x(t  + 1)  ■ Fx(t)  + B^u(t)  + B2v(t)  w - “j 

(2.7)  z(t)  - Hx(t)  + R(P)1/2u(t) 

by  means  of  a change  of  basis  and  an  orthogonal  transformation  of  w.  Hence, 
whenever  convenient,  we  shall  narrow  our  attention  to  realizations  of  the 
type  (2.6)  - (2.7).  We  shall  write  P for  the  set  of  all  symmetric,  posi- 
tive definite  P which  solve  (2.4)  - (2.5)  and  Q.  for  the  subset  of  P 

consisting  of  those  P such  that  R(P)  is  singular.  Notice  that  the 
realizations  corresponding  to  elements  of  Q.  are  precisely  those  which 
have  singular  intensity  of  the  noise  in  the  output  equation.  It  can  be 
shown  [12]  thac  P is  compact,  convex  and  forms  a complete  lattice  when 
endowed  with  the  natural  partial  order  P^  2 P2  if  and  only  if  P^  - P2  i 0. 

There  exist  a maximal  and  a minimal  element  P*  and  PA  so  that  P^  s 
P £ P*  for  all  P e P.  Moreover  the  minimality  of  the  process  z implies 

[13]  that  P*  - PA  and  R(P*)  are  positive  definite.  Hence  P\Q_  is 

nonempty.  The  following  result  provides  us  with  some  information  about 
the  set  Q.. 

I 
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Proposition  2.2  The  set  P\Q  is  convex.  For  all  P e P\Q,  Q e Q. 
and  A € (0,  1]  we  have  that  [AP  + (1  - A)Q]  e P\Q_.  The  set  Q.  is 
contained  in  the  relative  boundary  of  P. 

Proof.  The  first  two  results  follow  at  once  from  the  fact  that  for  P^, 

P2  € P,  A e [0,  1]  we  have  R(AP1  + (1  - X)P2)  - AR^)  + (1  - A)R(P2). 
They  in  turn  imply  that,  if  P e P\Q  and  Q e (£,  the  segment  [P,  Q] 

cannot  be  extended  beyond  Q without  leaving  P.  We  conclude  that  Q 

belongs  to  the  relative  boundary  of  P.  // 

Let  us  Introduce  the  mapping  A:  R n-P>  RnXn  defined  by 

(2.8)  A(P)  - -P  + FPF'  + (G  - FPHWP)"1^'  - HPF ') 

The  set  P\Q,  is  contained  in  the  domain  of  A(*).  It  is  possible  to 
extend  A(»)  to  all  of  P since  the  points  in  £ constitute  removable 
discontinuities.  We  can  now  derive  an  important  alternative  characteriza- 
tion of  the  set  P. 

Theorem  2.3  Let  A(-)  be  given  by  (2.8).  Then  P - {p|p  - P',  A(P)  s 0}. 

Proof.  Let  (P,  B^,  B2)  solve  (2.5)“ (2.6)  with  P * P'  and  P > 0.  Then 

if  P £ P\Qj  we  get  immediately  A(P)  * _®2B2*  ^ P € <£,  let  {P^}^^ 

be  a sequence  in  P\Q.  converging  to  P.  Then  A(P^)  S 0 and  it  follows 

that  A(P)  ■ lim  A(P. ) s 0.  This  shows  that  P e {p|p  ■ P',  A(P)  £ 0}. 
i 1 

The  other  inclusion  can  be  proven  by  an  argument  akin  to  that  used  by 
B.  D.  0.  Anderson  [4;  p.140].  // 

This  result  provides  a bridge  between  the  theory  of  positive  real 

I 

functions  and  the  study  of  quadratic  matrix  inequalities  and  algebraic 
Rlccati  equations. 


f 
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Let  us  Introduce  the  set  P • {P  e P|A(P)  - 0}.  Clearly  P con- 

o o 

slsts  of  all  P e P for  which  ■ 0. 

Remark  2.4  Since  the  eigenvalues  of  F lie  In  the  open  unit  disc,  ele- 
mentary Liapunov  theory  ensures  that  to  each  (B^,  B^)  there  corresponds 
a unique  P.  The  converse  does  not  hold  in  general.  However,  for  reali- 
zations of  the  form  (2.6)-(2.7),  to  each  P there  corresponds  a unique 
B^.  This  is  immediate  from  (2.5)  for  P e P\Q  and  holds  for  all  P e P 

since  points  in  Q.  appear  as  removable  discontinuities  of  the  map 
-1/2 

P *♦"  (G  - FPH')R(P)  • Hence  there  is  a unique  wide  sense  realization 

of  the  type  (2.6)-(2.7)  corresponding  to  each  P in  P . 

o 

Both  Problem  I and  II  seek  to  find  dynamical  systems  evolving  forward 
in  time  like  (1.1)-(1.2)  which  is  natural  to  call  forward  representations 
of  the  process  z.  Tet,  there  are  other  representations  of  interest. 

There  exist  situations,  for  example,  in  which  it  is  more  useful  to  con- 
sider a backward  representation  of  the  form 

(2.9)  x(t  - 1)  - Xx(t)  + Bw(t) 

(2.10)  y(t)  - Cx(.t)  + Dw(t) 

where  w is  a normalized  white  noise  such  that  w(t)  is  orthogonal  to 
H*(x)  for  all  t.  This  leads  us  to  formulate  the  backward  counterpart 
of  Problems  I and  II. 

I.  Wide  sense  backward  stochastic  realization  problem:  Determine,  from 




the  knowledge  of  4,  all  quadruplets  [A,  B,  C,  D] , with  dimension  of  A 

L 

minimal,  such  that  the  process  y,  generated  by  the  dynamical  system 
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(2.9)-(2.1Q)  driven  by  an  arbitrary  normalized  white  noise  w,  has  the 
same  spectral  density  <t>  as  z. 

II.  Proper  backward  stochastic  realisation  problem:  Given  H z 

find  all  quintuplets  [A,  ¥,  C,  D;  w] , with  dimension  of  A and 

w a normalized  white  noise  satisfying  H(w)  c H,  such  that  y(t)  given 
by  (2.9)-(2.10)  and  z(t)  are  equivalent  random  vectors  for  all  t. 

Solutions  to  Problems  I and  II  are  called  wide  sense  proper  back- 
ward stochastic  realisations  respectively.  We  shall  now  briefly  discuss 
Problem  I,  while  Problem  II  will  be  implicitly  solved  in  the  next  three 
sections  in  view  of  Theorem  2.5  below. 

Problem  I is  equivalent  to  the  dual  spectral  factorisation  problem 
considered  by  Anderson  [3]  and  Faurre  [12]  which  consists  in  finding  all 
minimal  unstable  (i.e.,  with  all  the  poles  outside  the  unit  circle)  spec- 
tral factors  W(z)  of  4(z) . It  follows  from  the  parahermitlan  property 
of  that  this  problem  is  equivalent  to  the  spectral  factorization  prob- 
lem for  ♦(•)*•  Hence  all  the  results  on  Problem  I have  a natural  coun- 
terpart in  the  backward  setting  via  the  duality  relation  (F,  G,  H,  $(«))  -*• 
(F',  Hr,  Gr,  ♦(*)')•  In  particular  all  solutions  to  Problem  I are  charac- 
terized by 

(2.11)  [A,  ¥,  C,  D]  - [T_1F  'T,  T*1 (B^ , ¥2)V,  G'T,  (R(P)1/2,  0)V] 

where  T and  V are  as  in  (2.3),  ^ is  n * m,  ¥2  is  n x (p  - m), 

P is  n x symmetric  and  positive  definite,  R(P)  - HF-1G  + 4>(«)  ' - G 'PG 
and  (P,  ¥2,  ¥2)  solve  the  system 
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i 


i 


(2.12)  P - F'PF  + + B2B^ 

(2.13)  H'  - F'PG  + B1R(P)1/2 


Whenever  1c  is  appropriate,  we  shall  restrict  ourselves  to  realiza- 
tions of  the  type 


(2.14)  x(t  - 1)  - F'x(t)  + BjU(t)  + B2v(t) 

(2.15)  z(t)  - G'x(t)  + R(P)1/2u(t) 


where  P is  the  state  covariance.  The  set  P of  all  symmetric,  positive 
definite  solutions  to  (2.12)-(2.13)  and  (£  of  all  P e V such  that  R(P) 
is  singular  enjoy  the  same  kind  of  properties  as  P and  Q.  respectively. 
In  particular  there  exist  PA  and  P*  such  that  S P S P*  for  all 
P € P.  It  is  well  known  [12,  37]  that  P - {P  ^|P  e P},  so  that  * 

(P*)  ^ and  P*  ■ (P^)  \ Indeed,  the  following  result  holds. 


Proposition  2.5  The  quadruplet  [F,  B,  H,  D]  with  B - (B^,  B2)  and 

i/2  _ _ 

D • (R(P)  , 0)  solves  Problem  I if  and  only  if  [F't  3,  Gr,  D]  solves 

Problem  I where 


(2.16)  B - (B^,  B2)  - -P“1F"1B(I  - B'P-1B)1/2 

(2.17)  D - (D  - HF'1B)(I  - B 'P”^B) X^2  - 

- (R(P)1/2  - HF_1Br  - HF_LB2)(I  - B fP~^B) 


Proof.  The  result  follows  from  long  but  simple  calculations  using  (2.4)- 
(2.5)  and  (2. 12)-(2.13) . // 

i 
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This  proposition  exhibits  a correspondence  between  forwasd  and  back- 
ward wide  sense  realizations  and  raises  the  question  whether  a result  of 
the  same  type  can  be  established  for  proper  realizations  . We  turn  to  this 
problem  in  the  beginning  of  the  next  section. 

2.3  Proper  stochastic  realizations. 


Let  us  consider  a proper  stochastic  realization  of  z [F,  B,  H,  D;  w] , 
with  state  process  x and  state  covariance  P.  As  is  well  known,  the  or- 
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and 

(2.20)  w(t)  - (I  - B'P-1B)1/2(w(t)  - B'(F')-V1x(t)) 

We  finally  obtain 

(2.21)  x(t  - 1)  - F'x(t)  - p-h'hd  - B'P_1B)1/2w(t) 

It  Is  not  difficult  to  check  that  v Is  a normalized  vhite  noise  such 
that  v(t)  Is  orthogonal  to  H*(x)  for  all  t.  The  forward  and  back- 
ward noises  are  related  as  follows 

(2.22)  (I  - B'P~1B)1/2w(t)  - w(t)  - E{w(t)|H^+1(x)} 

We  also  have 

z(t)  - Hx(t)  + Dv(t)  - [G  f(F ,)~1p”1  - DB  '(F  ,)~1P~1]x(t)  + Dw(t) 

- G'p"1x(t  + 1)  + [D  - G'p"1B][w(t)  - B'(F')"1P"1x(t)] 

- G'x(t)  + [D  - HF_1B]  [I  - B'P-1B]1/2w(t) 

Sunning  up  we  obtain  a strict  sense  version  of  Proposition  2.5,  analogous 
to  the  continuous  time  result  of  Lindquist  and  Plcci  [19]. 

Theorem  2.5  The  quintuplet  [F,  B,  H,  D;  w]  is  a proper  (forward)  sto- 
chastic realization  of  z with  state  process  x and  state  covariance  P 
if  and  only  if  the  quintuplet  [F',  If,  G',  D;  w]  is  a proper  backward 
stochastic  realization  of  z with  state  process  x given  by  (2.19)  and 
state  covariance  P \ where  w is  as  in  (2.20)  and  If,  D are  given  by 
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Results  closely  related  to  this  theorem  have  been  presented  by 
Akaike  [1;  p.168]  end  Ruckebusch  [33;  p.32].  However,  the  first  deals 
with  realizations  without  noise  In  the  observations,  the  second  does  not 
derive  expressions  for  w,  B and  D such  as  (2.20),  (2.16)  and  (2.17). 

So  far  we  have  said  nothing  about  existence  of  proper  stochastic 
realizations.  It  is  well  known  that  a necessary  and  sufficient  condition 
for  a purely  nondeterminlstic  wide  sense  stationary  process  z to  admit 

I 

finite  dimensional  stochastic  realizations  is  that  its  spectral  density 
is  rational  and  that  in  such  a case  there  exists  a unique  realization  of 
the  type  (2.6)-(2.7)  corresponding  to  P*  (cf.  [33]  for  example).  The 
minimum  variance  realization 

(2.23)  x*(t  + 1)  - Fx*(t)  + B*u*(t) 

(2.24)  z(t)  - Hx*(t)  + R(P*)1/2u*(t) 

is  the  steady -state  Kalman  filter,  with  the  steady-state  Kalman  gain  B* 
given  by 

(2.25)  B*  - (FZH'  + BD')(HlH'  + DD')~1/2  - (6  - FP*H  ')R(P*)-1/2 

where  [F,  B,  H,  D]  is  any  wide  sense  realization  and  Z is  the  unique 
notmegative  definite  solution  to  the  Algebraic  Riccati  Equation 

(2.26)  Z - FZF'  - (FZH'  + BD)  (HlH ' + DD')-1(HIF'  + DB ')  + BB ' 

The  noise  u*  is  called  the  innovation  process  and  is  characterized  by 
the  fact  that  Ht(z)  - Ht(uA)  for  all  t € Z.  Finally,  if  x is  the 
state  process  of  any  proper  realization  (2.6)-(2.7),  we  have 


— — 


(2.27)  x*(t)  - E{x(t)|Ht-1(z)} 

By.  duality  there  exists  a proper  backward  stochastic  realization  corres- 
ponding to  P . , namely  the  backward  steady- state  Kalman  filter 


(2.28) 


(2.29) 


x*(t  - 1)  - F'x*(t)  + B*u*(t) 
*(t)  - G'x*(t)  + R(P*)1/2u*(t) 


Here  the  backward  steady-state  Kalman  gain  B*  Is  given  by 

(2.30)  B*  - (F'lG  + BD')  (G 'IG  + DD  ')"1/2  - (H  ' - F 'P*G) R(P*) “1/2 

where  [Ff,  B,  G D]  is  any  backward  wide  sense  realization  and  7 is 
the  unique  nonnegative  definite  solution  to  the  Dual  Algebraic  Riccati 
Equation 


(2.31)  I - F'lF  - (F'ZG  + BD')  (G 'EG  + DD #)_1(G  'EF  + 55')  + BB ' 

The  equality  H+(z)  “ H*(u^)  for  all  t e Z characterizes  the  backward 
innovation  process  u,. . The  backward  filter  satisfies 


(2.32) 


x*(t)  - E{x(t)|H  (z)} 


where  x is  the  state  of  any  proper  backward  realization  (2.14)-(2.15) . 
By  Theorem  2.5  there  exists  a proper  stochastic  realization  corresponding 
to  (2.28)-(2.29)  (which,  as  it  will  be  apparent  in  the  next  section,  is 
unique) 


(2.33) 


x*(t  + 1)  - Fx*(t)  + B*u#(t) 


(2.34)  z (t)  - Hx*(t)  + R(P*)1/2u*(t) 


i 
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with  state  covariance  P*.  Then,  If  x Is  the  state  process  of  any  real- 
ization, 

(2.35)  I*(t)  - (P*)_1x*(t  + 1)  - P_1E{x(t  + l)|H^+1(z)} 

amA 

-1 


(2.36)  E{x(t)|Ht(z)}  - P(P*)  x*(t) 


-1 


This  justifies  our  choice  of  working  with  P x rather  than  x in  the 
backward  setting.  In  fact  (2.36)  is  not  invariant  over  P. 

Definition  2.6  ([19,  33]).  A proper  stochastic  realization  of  z with 
state  process  x is  said  to  be  internal  If  H(x)  £ H(z) , external  other- 
wise. 

Internal  realizations  are  of  particular  Interest  since  they  are  the 
only  one  we  can  construct  from  the  process  z.  For  example,  the  minimum 
and  maximum  variance  realizations  Introduced  in  this  section  are  internal. 
It  should  be  noted  that  the  existence  of  external  realizations  depends  on 
H.  If  H * H(z) , for  instance,  all  realizations  would  be  internal. 

2.4  Characterization  of  Internal  realizations. 

Let  us  consider  the  spectral  representation  of  z (see  e.g.  [31]) 
given  by 


z(t)  - 


IT 


iut  x 
e dz  (u) 


-IT 


where  d£  is  an  orthogonal  stochastic  measure  such  that 


E{dz(o>)  dz(u>)+}  * *(eiai)  da> 


; — 
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(Here  + denotes  complex  conjugation  and  transposition.)  Let  W(z)  - 
-1  1/2 

H(zl  - F)  ^ + R(P)  be  a square  (m  x m)  spectral  factor  of  4>(z) . 
Then  the  process  u defined  by 

fir 


(2.37) 


u(t)  - 


lwtr.T/  iw.,-1  . 

e [W(e  )]  dz(oi) 


•'-■nr 

Is  a normalized  white  noise  such  that  u(t)  e H(z)  for  all  t [31;  p.41] 

1/2 

and  consequently  [F,  B^,  H,  R(P)  ; u]  is  an  Internal  realization  of 
z.  The  following  result  shows  that  W(*)  being  a square  matrix  function 
is  also  necessary  for  a realization  to  be  Internal. 


Theorem  2.7  ([19,. 33])  A proper  stochastic  realization  is  internal 

if  and  only  if  its  transfer  function  is  square. 


It  follows  from  this  theorem  and  Remark  2.4  that  Internal  realizations 
of  the  form  (2.6)-(2.7)  are  In  one  to  one  correspondence  with  the  real 
symmetric  solutions  of  the  matrix  equation  A(?)  » 0.  Hence,  to  charac- 
terize further  internal  realizations,  one  could  derive  the  discrete  time 
counterpart  of  the  fundamental  results  of  J.  C.  Willems  [40]  on  the  alge- 
braic Rlccati  equation.  However,  a result  akin  to  the  classification  of 
the  solutions  fo  the  algebraic  Rlccati  equation  can  be  obtained  directly 
for  the  state  processes  of  internal  realizations.  Notice  that  once  the 
state  x(t)  of  an  Internal  realization  has  been  determined  the  input 
u(t)  can  be  obtained  inverting  (2.9)  as  follows 

u(t)  - -R(P)"1/2Hx(t)  + R(P)~1/2z(t) 


(In  the  case  when  R(P)  is  singular  we  need  to  perform  an  appropriate 
number  of  differencing  operations  on  the  output  in  various  directions 
(cf.  [7]  for  example)  before  we  can  express  u in  terms  of  x and  z.) 


Therefore  we  turn  to  the  problem  of  characterizing  the  state  pro-* 
cess  of  Internal  realizations. 

Let  us  Introduce  the  feedback  matrix 

r.  - F - V<F„>'1/2H 

The  matrix  T*.  is  asymptotically  stable  due  to  the  minimality  of  z [13]. 
It  plays  a central  role  in  stochastic  realization  theory,  as  it  is  clear 
from  what  follows.  In  particular  we  have  the  following  Important  result, 
whose  continuous  time  counterpart  can  be  found  in  [19]. 

Theorem  2.8  ([33])  The  process  x is  the  state  of  an  internal 

realization  if  and  only  if 

(2.38)  x(t)  - [I  - it  ]x*(t)  + »,**<t) 

where  it  is  the  projection  onto  an  invariant  subspaae  S of  T along 

(P*  - P^)SA.  The  covariance  P of  x and  ir  are  related  as  follows 

s 

(2.39)  TTg  - TT(P)  - (P  - P*)(P*  ~ P*)"1 

We  shall  give  a new  proof  of  this  theorem,  by  means  of  an  approach 

which  allows  us  to  characterize  also  the  external  realizations  in  the 

same  framework.  Our  derivation  hinges  on  the  following  simple  observa- 

1/2 

tion.  Let  [F,  (B^,  B.,)  , H,  R(P)  ; w]  be  a proper  stochastic  realiza- 
tion of  z with  state  process  x and  state  covariance  P.  Then 
[^*»  (®1  “ ®*^(^*)  R(P)  .BjKlUP*)  ^ H,R(PJk)  ^^R(P)^^;w]  is  a proper 


(nonminlmal)  stochastic  realization  of  the  innovation  process  u*  with 
state  process  x - x#  and  state  covariance  P ■ P - P#.  This  can  be  seen 


by  Inverting  the  filter  (2.23)-(2.24)  to  get 


(2.40) 


x*(t  + 1)  - I>*(t)  + B^RCP*)-1'  z(t) 


(2.41)  u*(t)  - -R(P*)'1/2Hx*(t)  + R(P*)-1/2z(t) 


and  using  (2.6)-(2.7).  If  we  set  x(t)  - x(t)  - x*(t),  we  obtain  the 
model 


(2.42)  x(t  + 1)  - T*x(t)  + (Bx  - B*R(PA)-1/2R(P)1/2)u(t)  + B2v(t) 


(2.43)  u*(t)  - R(PA)"1/2fe(t)  + R(P*)"1/2R(P)1/2u(t) 


which  is  a forward  stochastic  realization  of 


u.  since  w(t)  i H (x) 
w c 


for  all  t.  The  representation  (2.42)-(2.43)  is  not  minimal  since  u* 
is  a white  noise  and  its  minimal  realizations  have  dimension  zero.  Con- 
versely consider  a forward  stochastic  realization  of  uA  of  the  form 


(2.44)  C(t  + 1)  - r*£(t)  + BLu(t)  + B2v(t) 


(2.45)  u*(t)  - R(P*)“1/2[H5(t)  + R(P)1/2u(t)] 


where  w is  a normalized  white  noise  and  B^  is  n x m.  Observe  that 

w(t)  is  orthogonal  to  Ht(x),  where  x - E,  + x*,  since  x*(t)  c Ht_1(z) 

- ~ ~ 1/2  ~ 

Ht_^(u#).  We  conclude  fran  this  that  [F,  (B^  + B^R(P)  , B2) , H, 

1/2  « 1/2 

R(P^)  R(P)  ; w]  is  a minimal  stochastic  realization  of  z.  We  col- 
lect these  observations  in  the  following 


. A 
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Lenina  2.9  The  map  which  sends  the  realization  [F,  (b^,  B^),  H, 
R(P)1/2;  w]  to  the  realization  [r*,  (Bx  - B*R(P*)-1/2R(P)1/2,  B2), 
R(P*)  "1/2H,  R(P*)  ■1/2k(W1/2;  w]  is  a one  to  one  correspondence  between 
realizations  of  z of  the  form  (2. 6) -(2. 7)  and  realizations  of 
of  the  form  (2.44)-(2.45) . 


The  map  in  Lemma  2.9  also  induces  a correspondence  between  state  covari- 
ances which  maps  P e P to  P - P*,  translating  the  set  P of  the 
amount  — P^.  The  set  P m P — has  the  zero  element  as  its  minimum 
and  the  positive  definite  quantity  P*  - P^  as  its  maximum.  Notice  that 
the  correspondence  established  in  Lenma  2.9  is  simply  the  correspondence 
between  the  two  input-output  relations 


z(t) 


e^WCe1")  dO(w) 


■'-IT 


and 


u*(t) 


»S“ST1(e1“)W(e1“)  df>(u) 


-IT 


—1  1/2 

where  W(z)  *•  H(zl  - F)  (B^,  B^)  + R(P)  , d©  is  an  orthogonal  stochas- 


tic measure  such  that  w(t)  - / n elh3t  d#(ui)  and  W.(z)  - H(zl  - F)-3^*  + 

-7T  * * 

R(P.)1/2. 


From  (2.23)-(2.24)  and  (2.40)-(2.41)  we  know  that  H (z)  ■ H (u.) 

t t * 

for  all  t and  H(z)  ■ H(u^) . Since  u^  is  a white  noise  we  have  the 
following  orthogonal  decomposition  for  the  space  H(z) 


(2.46)  H(z)  - H~_l(z)  ® H*(u*) 

Then,  if  x is  the  state  process  of  an  internal  realization,  we  have 
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x(t)  - E{x(c) |h(z) } - E{x(t) |H~_^(z) } + E{x(t) |h*(u*)} 
which  implies 

(2.47)  x(t)  - x*(t)  + E{x(t)  - x*(t)|H+(u*)} 

in  view  of  (2.27)  and  the  orthogonality  between  x*(t)  and  H*(u^) . To 
compute  E{x(t)  - x*(t) |Ht(uA) } observe  first  that  x(t)  ■ x(t)  - xA(t) 

Is  the  state  process  of  a realization  of  u*  of  the  form  (2.41)-(2.42) . 
Secondly,  notice  that  uA  is  a stochastic  process  enjoying  all  the  pro- 
perties  of  z.  Therefore  we  simply  derive  relation  (2.36)  with  x and 
uA  In  place  of  x and  z respectively.  Ibis  idea  of  replacing  a sto- 
chastic process  by  its  innovations  is  of  course  very  common  in  filtering 
theory  and  it  turns  out  to  be  helpful  also  in  our  context. 

We  shall  now  derive  the  backward  counterpart  of  a realization  of  the 
type  (2.42)-(2.43)  corresponding  to  an  internal  realization.  We  set  B2  ■ 
0 in  (2.42)-(2.43)  and  define  P ■ P - P^.  An  orthogonal  decomposition 
for  x(t)  as  in  (2.18)  yields  the  Identity 

(2.48)  x(t)  - pr;?*x(t  + i)  + £(t)  - ?r*p*x(t  + i)] 

Observe  that  x(t)  - PI*/P^x(t  + 1)  is  orthogonal  to  H . (z) . Also, 

t— l 

using  (2.42)-(2.43) , we  see  that  E{x(t)  - Pr^x(t  + l)|H^+1(u*)>  - 0. 
Hence,  using  (2.46), 

»(t)  - ?r;p#x(t  + i)  - £{x(t)  - ?r;p#x(t  + i)|u*(t)} 

- E{x(t) |uA(t) } - PH'R(P*)"1/2u*(t) 


and  (2.48)  becomes 
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(2.49)  x(t)  - + 1)  + PH'R(P*)"1/2u<t(t) 

or 

fmtJk***  •I/O 

(2.50)  P x(t)  - rPl\JPx(t  + 1)  + P PH'R(P*)  ' U*(t) 

The  output  simply  reads 

(2.51)  u*(t)  - 0P#£(t  + 1)  + u*(t) 

where  0 is  the  m x n zero  matrix.  The  model  (2.50)-(2.51)  is  the 
backward  counterpart  of  (2.42)-(2.43) . We  stress  the  fact  that  all 
backward  realizations  of  the  innovations  which  we  obtain  in  this  fashion 
from  realizations  (2.42)-(2.43)  with  - 0 have  the  same  input  noise 
u#.  For  x * x*  - we  obtain  the  backward  filter 

(2.52)  (P*  - P*)'1(x*(t)  - x*(t))  - 

- T^(P*  - P*)_1(x*(t  + 1)  - x*(t  + 1)) 

+ H'R(P*)~1/2u*(t) 

A 

Using  alternatively  (2.42)  and  (2.49)  to  compute  E(x(t  + l)x(t)^ 
we  establish  the  identity  T^P  ■ PP^F^P  which  gives 

(2.53)  pr*'  - pr^p 

Then,  using  (2.49)  and  (2.53)  we  obtain 

00 

x(t)  - P l (r^)iH,R(P*)~:L/2u*(t  + i) 
i-0 

which,  together  with  (2.52),  yields  the  desired  expression 

x(t)  - x*(t)  + (P  - P*)(P*  - P*)_1(x*(t)  - x*(t)) 


(2.54) 
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Hence  x(t)  c H(x*(t)  - x*(t))  and  (2.54)  can  be  written 

(2.55)  x(t)  - E{x(t)|x*(t)  - x*(t)} 

- (P  - P*)(P*  - P*)_1(x*(t)  - x*(t)) 


from  which  it  is  seen  that  tt(P)  - (P  - P*)(P*  - P*)~  is  a projection. 
Rewriting  (2.53)  in  the  form 

T*ir(P)  - ir(P)(P*  - P*)P#r*ir(P) 

we  see  that  ir(P)  projects  onto  an  invariant  subspace  of  T*.  Since 
ir(P)(P*  - P^)  * (P*  - P*)tt(P)  ' and  ir(P)  ' projects  along  S'1  [15;  p.61], 
we  conclude  that  ir(P)  projects  parallel  to  (P*  - P^) SX.  Conversely 
if  ir  projects  onto  an  invariant  subspace  of  T*  and  ir(P*  - P^)  ■ 

(P*  - ir  is  an  admiasable  projection  in  Ruckebush's  language, 

it  is  easy  to  construct  first  a realization  of  the  Innovations  and  then 
one  (internal)  of  z along  the  same  lines  as  in  [33].  This  completes  the 
proof  of  Theorem  2.8.  // 

Remark  2.10.  Notice  that,  given  the  special  form  of  the  realization 
(2. 50) -(2. 51),  we  did  not  need  to  Invoke  any  invariance  property  such  as 

A # 

(2.32)  of  the  filter  (2.52)  to  compute  E{x(t) |Ht(u#) }.  The  following 
interpretation  for  Theorem  2.8  emerged  in  the  proof.  The  state  process 
of  an  internal  realization  of  z is  given  by  the  forward  filter  of  z 
plus  a "piece"  of  the  maximum  variance  error  x*(t)  - x*(t). 

This  piece  must  be  such  as  to  conform  with  the  dynamics  of  x*(t)  - x*(t) 
which  is  determined  by  the  transition  matrix  F^,  i.e. , it  must  correspond 
to  an  invariant  subspace  of 
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2.5.  External  realizations. 

It  is  clear  that  a necessary  condition  for  the  existence  of  external 
realizations  is  the  presence  in  H of  elements  orthogonal  to  H(z).  For 
the  sake  of  simplicity  we  assume  that  H * H(z)  ® H(£)»  where  £ is  an 
n-dlmensional  normalized  white  noise  orthogonal  to  H(z) . As  it  will  be 
apparent  from  what  follows,  this  assumption  is  the  minimum  one  needed  to 
guarantee  the  existence  of  a proper  stochastic  realization  corresponding 
to  each  wide  sense  stochastic  realization. 

Let  x be  the  state  process  of  a realization  (2.6)-(2.7)  and  P its 
covariance.  Then  the  counterpart  of  (2.47)  is 

(2.56)  £(t)  - x*(t)  + E&(t)|H+(u*)}  + E{x(t)|H(C)} 
and  (2.48)  corresponds  to 

(2.57)  x(t)  - Pr^P^xU  + 1)  + PH'R(P*)“1/2u*(t) 

+ E{x(t)  - FT^xCt  + 1)  |H(Ol 

Now  let  us  assume  that  £ is  chosen  in  such  a way  that  the  condition 
Ht_^(£)  l H*(x)  holds  and  £ and  x are  stationarily  correlated  for 
every  realization  (2.42)-(2.43) . This  assumption  is  introduced  to  enable 
us  to  treat  £ in  the  same  way  as  the  innovations.  It  will  be  clear 
from  what  follows  that  indeed  this  is  a natural  assumption  when  trying 
to  model  all  realizations  using  a unique  exogenous  noise.  We  can  now 
add  to  (2.42)-(2.43)  the  output. 

£(t)  - Jfe(t)  + [£(t)  - l£(t)] 

where  M ■ E{£(t)x(t)  '}P  and  an  argument  very  similar  to  that  used  for 
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A Jt 

the  innovations  gives  E{x(t)  - Pl^P  x(t  + 1)|h(C)}  • PM'£(t)  so  that 

(2.57)  becomes 

A*  A*  AliM  .1  / O M 

(2.58)  x(t)  - Pr;rx(t  + 1)  + PH'R(P*)  ' u*(t)  + PM'C(t) 

Note  that  M must  satisfy 

P - Pr^r^P  + PH'R(P*)-1HP  + iw'MP 

and  that,  as  in  the  internal  case,  the  input  noise  (uA,  O is  the  same 
for  all  realizations. 

0^0  0^0  ^ Jp 

Let  Xj(t)  and  Xg(t)  denote  E{x(t) |Ht(u*)> 
and  E{x(t)  |H(?)}  respectively.  Then  it  follows  from  (2.58)  that 

(2.59)  xx(t)  - (P  - P*)(P*  - P*)_1(x*(t)  - x*(t)) 
and 

(2.60)  ^(t)  - Pr^x^t  + 1)  + PM'?(t) 

Using  (2.53),  (2.56),  (2.59)  and  (2.60)  we  conclude  that 

(2.61)  x(t)  - x*(t)  + (P  - P*)(P*  - P*)"1(x*(t)  - x*(t)) 

00 

+ l (r;)Ve(t  + i) 

i-0 

Conversely,  given  any  matrix  M such  that  M'M  e C , let  P solve 

n 

P"1  - + H'R(P*)_1H  + M'M 

Then,  using  (2.61),  we  construct  the  state  of  a stochastic  realization 

a# 

of  z.  All  the  realizations  with  singular  P can  be  obtained  through 





limiting  procedures,  using  realizations  corresponding  to  unbounded  se- 


quences of  M *M  in  the  cone  C . 

n 

The  derivation  of  the  classification  of  external  realizations  pre- 
sented above  is  quite  similar  to  the  one  given  in  [33;  p.65],  but  we 
feel  it  will  give  some  further  insight  into  the  concepts  described  there. 
Moreover  it  provides  a clear  stochastic  meaning  for  the  parametric  rep- 
resentation of  the  set  P derived  by  Faurre  [12;  p.52]  in  continuous 

■■ 

time  and  by  Germain  [13;  p.61]  in  discrete  time.  Finally  the  input  pro- 

. 

cesses  of  external  realizations  can  be  characterized  along  the  same  lines 
as  in  [19]. 

1 


Part  3 


DISCRETE  TIME  STOCHASTIC  REALIZATION: 
THE  SINGULAR  CASE 


3.1  Invariant,  predictable  and  smoothable  subspaces. 

Problems  I and  II  are  called  singular  when  $(•)  is  singular.  It 
follows  from  Theorems  1.6  and  1.9  that  in  the  singular  case  there  exist 
nontrivial  invariant  directions  for  the  Riccati  equation  (1.5)  associated 
to  every  solution  to  Problem  I.  Abusing  language  we  shall  say  that  a 
vector  a is  invariant  (predictable)  for  [A,  B,  C,  D]  if  it  is  invari- 
ant (predictable)  for  the  corresponding  equation  (1.5). 


Proposition  3.1  The  space  I of  invariant  directions  is  invariant 
over  ail  wide  sense  realizations  of  z. 

Proof.  Immediate  from  Theorems  1.6  and  1.9.  // 


The  following  result  describes  the  singular  case  in  a number  of 
different  ways. 


Theorem  3.2  The  following  statements  are  equivalent: 
(i)  $(<*>)  is  singular. 

(ii)  T*  is  singular. 


j---  - 


, rtt-  rl  I'jEnT- '■*  J 
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(Hi)  R(P*)  ia  singular. 

(iv)  R(P*)1/2  - B^F')"1!!'  is  singular. 

Proof.  Let  Y e f be  in  the  null  space  of  $(*).  Then,  recalling  that 
♦(•)  ■ DD'  - DB’(F')  ^H'  where  [F,  B,  H,  D]  is  any  wide  sense  realiza- 
tion, we  obtain  from  (2.25)  BiJ(F,)_1H'Y«  (HZH'  + DD')1/2y  - R(P*)1/2y. 

Hence  Y « N(R(P*)1/2  - B*'(F')_1H')  and  (F  f)-1H  'Y  « N(I\J)  • Conversely, 
if  (ii)  holds,  use  the  fact  that  the  eigenvalues  of  are  equal  to  the 

zeroes  of  the  determinant  of  to  get  (iv)  from  which  (i)  follows  tri- 

vially. The  equivalence  between  (ii)  and  (iii)  has  been  proven  by 
Ruckebusch  [33;  p.70].  // 

Corollary  3.3  The  set  Q,  is  nonempty  if  and  only  if  $(«)  is 
singular . 

Proof.  For  any  P e i we  have  R(P*)  £ R(P).  // 

This  says  that  the  singular  case  occurs  precisely  when  some  of  the 
vide  sense  realizations  have  R(P)  singular,  in  particular  when  R(P*) 
is  singular.  This  constrast  with  the  continuous  time  situation  where, 
when  the  innovation  process  is  full  rank,  all  the  input  noises  have  non- 
singular intensity. 

Let  Tg(i)  i ■ 0,  1,  ...  be  as  in  Theorem  1.9  so  that  $(z)  - 

mOP  — i w 

l±mQ  Tg(i)z  for  | z | large  enough  and  T*  be  the  weighting  pattern 
(1.17)  corresponding  to  the  minimum  variance  realization. 

Theorem  3.4  The  following  statements  are  equivalent: 

(i)  a is  an  s-invariant  direction  of  the  wide  sense  realization 
[F,  B,  H,  D]. 

UU  a - l[ml  (F ')_iH ,X±  with  TH(i)A  - 0 j - 1,  ....  s 
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(Hi)  a - (F')"iH,Xi  with  \a~JQ  T*(i)X 


- 0 j - 1,  ...»  s. 


(iv)  a - lS±ml  (F')"1H,Xi  with  a'x*(t)  - X^z(t  - 1)  for 
all  t. 

(v)  a is  a generalized  eigenvector  of  rank  s (an  eigenvector  if 
8-1 ) of  r;  corresponding  to  the  eigenvalue  zero. 

Proof.  The  eigenvalence  of  (ii)  and  (iv)  is  immediate.  The  rest  follows 
at  once  fran  Theorem  1.6,  in  view  of  Proposition  3.1  and  the  fact  that 
the  deterministic  and  stochastic  elements  in  the  minimum  variance  reali- 
zation can  be  obtained  as  limits  of  the  corresponding  quantities  in  a 
transient  Kalman  filter  of  the  form  (1.3).  // 

Corollary  3.5  All  the  invariant  directions  of  [F,  B*,  H,  R(P*)^2] 
are  predictable. 

Proof.  It  follows  directly  from  Theorem  1.11  and  condition  (iii)  of 
Theorem  3 A.  // 

Note  that  in  Theorem  3.4  the  space  I appears  as  the  invariant 
subspace  of  T*  related  to  the  zero  eigenvalue.  We  now  Introduce  the 
backward  counterpart  of  the  concept  of  invariant  direction.  A vector  a 
is  said  to  be  a dually  s-invariant  direction  of  the  dual  transient  Rlccati 
equation 


* 

I(t  - 1)  - F'f(t)F  - (F'Z(t)G  + BD')(G'f(t)G  + DD  f)-1(G 'Z(t)F  + 


(3.1) 


+ BB ' 


17(0)  - P 


if  a'E(-t;  p)  - a'Z(-s;  0)  for  all  t 2 s and  all  P « C . Also  let 

n 


42 


i 


a — 1/9 

T be  given  by  (1.17)  with  [F',  B,  G',  Rtf)*'*]  in  place  of 

[A,  B,  C,  D] . Duality  now  gives  the  following  result. 


Corollary  3.6  The  following  statements  are  equivalent: 

(i)  a is  a dually  s- invariant  direction  of  the  backward  wide 
sense  realization  [F't  B,  G',  DJ. 


(ii) 

a * 

1*1-1 

with 

Co  V1’  ’“j-M.  ■ 0 i ■ 1 ‘ 

(Hi) 

a “ 

IU 

with 

Co  f.«>V  - 0 3 - 1 

(iv) 

a • 

Z*  F‘S. 

with 

a'x<r(t)  - y^z(t  + i)  for  all 

(v)  a is  a generalized  eigenvector  of  rank  s (an  eigenvector 
if  s - 1 ) of  T*  - F'  - B#R(P#)  1/2Gr  corresponding  to  the 
eigenvalue  zero. 

Next  we  define  the  dual  counterpart  of  predictability. 

Definition  3.7.  The  n-dlmensional  vector  a is  called  an  a-smoothable 
direction  of  equation  (3.1)  if 

(3.2)  a'I(-t;  P)  ■ a'E(-s;  P)  • 0 for  all  t £ s 

The  terminology  is  motivated  by  the  fact  that  if  a satisfies  (3.2)  then, 
by  property  (iv)  in  Corollary  3.6,  we  can  smooth  the  state  of  any  proper 
stochastic  realization  corresponding  to  [Ff,  B,  G',  D]  exactly  in  direc- 
tion P ^a.  Clearly  all  the  dually  invariant  directions  of 

_ j_/2  — 

[F ’ , Ba,  G',  R(Pa)  ] are  smoothable.  Let  I indicate  the  space  of 

the  invariant  directions  of  (3.1)  which,  by  Proposition  3.1  and  duality, 

is  invariant  over  all  backward  wide  sense  realization.  Ruckebusch  proved 


i 


i 

I 


■ 


I 
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that  T*  - (P*)’1(P*  - P*)I\J(P*  - P*)”^*  [33;  p.53).  Therefore  it 

follows  from  Corollary  3.6  that  (P*  - P^MP*)”1!"  is  the  invariant  sub- 
space of  corresponding  to  the  zero  eigenvalue.  Moreover  the  dimen- 

sions of  I and  1 are  equal.  The  following  theorem  characterizes  the 
predictable  subspace  of  an  internal  realization  and  the  smoothable  sub- 
space of  the  corresponding  backward  realization.  It  also  shows  that  the 
sum  of  the  dimensions  of  these  two  subspaces  is  constant  and  equal  to 
dim  I. 


Theorem  3.8  Let  x be  the  state  process  of  the  internal  realization 
1/2 

[F,  B^,  H,  R(P)  ; u]  and  S the  invariant  subspace  of  F*  associated 
with  x in  Theorem  2.8,  so  that  x(t)  - x*(t)  + irs(x*(t)  - x*(t))  with 
ir  given  by  (2.39).  Then,  if  a - . (F')  ^'A  belongs  to  S1  n I 

and  a - F iQn1  belongs  to  P*(P*  - P*)-1S  n T we  have 

(3.3)  a'x(t)  - l X;z(t  - i) 

i-1  1 


and 

(3.4)  ItP*)"1x(t)  - l y'*(t  + i - 1) 

i-1 

Moreover  dim(SX  n I)  + dim(P*(P*  - P*)_1S  n T)  - dim  I. 

Proof.  Since  (P*  - P.)  ^ir  (P*  - P ) - ir'  and  ir  projects  parallel 

— s *s  s 

to  (P*  - P*)S1,  we  have  a'ir  -0  and  a'(P*)  - a'(P*)~\  Proper- 

s s 

ties  (iv)  of  Theorem  3.4  and  Corollary  3.6  now  yield  (3.3)  and  (3.4)  res- 
pectively. Let  k be  the  smallest  positive  integer  such  that  I - 

Theorem  3.4(v)  insures  the  existence  of  such  a k.  Then  we 
have  the  direct  decomposition  Rn  - I • ^((r^)^)*  where  R((f^)lt)  is  the 
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range  space  of  (rj)k,  cf.  [15;  p.166]  for  example.  Consider  also  the 
usual  orthogonal  decomposition  R “ ® R((r^)  ),  where  ^((1^)^)  “ 

p*  _ (p*)_1T.  It  follows  that  dim(Snl)  ■ dim(S  n (P*  - PA)  (P*)  *T)  . 

To  complete  the  proof,  observe  that  I * (I  n S)  • (I  n and  that 

dim(S  n (P*  - P*)(P*)_1T)  - dim(P*(P*  - P*)”3^  n T).  // 


It  is  worthwhile  mentioning  that  a ' (P*)”1  in  (3.4)  has  actually 

the  form  with  ££*  T(k)  Vk  " 0 for  3 “ 1 

as  one  can  readily  verify  using  (2.4)-(2.5)  and  (2.16)-(2.17)  to  estab- 
lish the  correspondence  between  T(*)  ' and  !(•)•  Conversely  such  a 
vector  leads  to  a smoothable  direction  in  the  backward  setting.  Hence 
a predict able-smoothable  direction  in  the  forward  setting  (i.e.,  a di- 


rection in  which  the  state  can  be  computed  from  a finite  number  of  obser- 
vations z)  has  the  form  (F')^^  with  Y € NC0,  where  Y'  - 

(Y'  . Y'  ....  Y'»  Y ' t •••»  Y%)  and  T is  a block  diagonal  matrix, 

' 'n— 1 n— 2 o — n —1 

the  two  diagonal  blocks  being  block  triangular  Toeplitz  matrices.  The 
upper  one  has  i^k  row  [T(i  - 1) f,  T(i  — 2) f,  •••»  T(0)  *,  0,  ...»  0]  and 
the  lower  one  has  i^k  row  [T(i  — 1) , T(i  — 2) , ...»  T(0),  0,  ...,  0], 
where  i “ 1,  ...»  n. 


The  1,-tnaar  hull  of  the  components  of  x*(t)  and  x*(t)  is  called 
the  frame  space  [18]  and  denoted  by  H^(z) . In  view  of  Theorem  2.8,  we 
know  chat  the  components  of  the  state  at  time  t of  an  internal  realiza- 
tion belong  to  Art.  Let  us  introduce  the  subspace  H +(z)  of  H^(z) , 
t t 

given  by  the  linear  hull  of  elements  of  the  form  a'x*(t)  and 

a'(P*)-1x*(t),  where  a varies  over  I and  a over  J.  By  analogy  to 

the  continuous  time  case  [10] , we  shall  call  H +(z)  the  germ  space , 

t 
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since  it  contains  linear  combinations  of  differences  of  the  type  b^z(a)  - 
z(s)  - z(s  - r)  and  of  certain  other  values  of  the  process  z that  in- 
dicate precisely  the  degree  of  "smoothness"  of  the  covariance  of  z in 

different  directions.  Then  Theorem  3.8  shows  that  dim(X(t)  n H +(z))  - 

t 

dim  X,  where  X(t)  is  the  space  spanned  by  the  components  of  the  state 
x(t)  of  an  internal  realization.  Note  that  in  contrast  to  the  contin- 
uous time  situation  [18],  the  inclusion  H +(z)  c X(t)  does  not  hold. 

From  now  on  let  dim  I m V. 

Theorem  3.9  Let  [F,  H,  R(P)  ; u]  be  an  internal  realization. 
Then  this  realization  can  be  embedded  in  a chain  of  internal  realizations 

[F,  B1(i),  H,  R(Pi)1/2;  u^  with  state  spaces  Xi(t),  i - 0 

such  that  P SP.S-S  P . (X  (t)  n H .(z))  c H~  . (z)  and 

O L VO  t“  X 

(Xv(t)  n H +(z))  c H+(z). 

Proof.  Let  S be  as  in  Theorem  3.8  and  a^,  . ..,  a^  be  a basis  for 
S'1  n I.  Then  we  can  generate  a family  of  invariant  subspaces  of 

T*.  i - 0,  ....  V,  with  dim(S^  n I)  - v - i,  simply  eliminating  from 
S1,  one  at  a time,  the  or  adding  to  S1  new  linearly  independent 

elements  of  I,  both  operations  being  performed  taking  due  care  of  the 
rank  of  the  generalized  eigenvectors  which  are  dropped  or  added,  so  that 
the  resulting  subspace  is  indeed  invariant  for  T*.  This  can  be  done 
since  I rnn  be  decomposed  into  cyclic  subspaces.  Clearly  this  procedure 
•yields  a family  of  internal  realizations  which  differ  only  on  the  germ 
space  and  such  that  S " The  state  covariances  are  totally  ordered 


since,  if  i < j and  x^t),  x^  (t)  are  the  corresponding  state  pro- 
cesses, x. (t)  is  equal  to  x*(t)  in  any  direction  in  which  it  differs 
1 


Nefarn'  i TV. 
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from  x. (t).  Finally,  by  construction,  [F,  B. (0) , H,  R(P  )1/Z;  u ] has 
j loo 

a full  size  predictable  subspace  and  the  backward  realization  correspond- 

1/2 

ing  to  [F,  B^(V),  H,  R(PV)  ; u^]  has  a full  size  smoothable  subspace. 
Thus,  the  last  assertion  of  the  theorem  follows.  // 


Notice  that  the  chain  of  realizations  in  Theorem  3.9  is  by  no  means 

unique.  However  the  minimum  and  the  maximum  realizations  are  uniquely 

determined.  In  the  case  when  T*  is  cyclic,  the  number  of  internal 

realizations  is  finite  and  £ 2n  [40;  Remark  18] . Our  work  has  shown 
a— v 

that  2 (v  + 1)  is  actually  an  upper  bound  in  the  cyclic  case.  In 
fact  internal  realizations  are  in  one-to-one  correspondence  with  the  in- 
variant subspaces  of  and,  when  F*  is  cyclic,  I is  cyclic  «n A the 

chain  of  invariant  subspaces  constructed  in  Theorem  3.9  is  unique,  so 

that  the  number  of  different  invariant  subspaces  of  is  less  than  or 

n— v 

equal  to  2 (v  + 1). 

Let  us  consider  a proper  external  realization  of  the  form  (2.6)- 

(2.7)  and  an  invariant  direction  a - (F')*^^  for  it  which  is  not 

predictable.  Then  two  cases  can  occur.  Either  (F ')”*H rX. . . be- 

fai«l  i+j 

longs  to  for  j “ 0,  •••»  n — 1 or  it  does  not.  It  can  be  seen 

that  in  the  first  case  we  are  in  a situation  akin  to  the  one  for  internal 
realizations  and  we  can  associate  to  the  vector  a a smoothable  direction  in 
the  backward  setting.  In  the  second  case,  which  always  occurs  if 
^2^2  > a Invariant  but  the  state  cannot  be  determined  exactly 
from  a finite  string  of  observations  and  we  would  need  to  have  available 
the  process  £ orthogonal  to  H(z)  and  to  model  external  realizations 
as  done  in  Section  2.5  to  be  able  to  calculate  the  state  in  v linearly 
independent  directions.  For  the  sake  of  brevity,  we  have  avoided  here 
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going  Into  details  about  external  realizations.  However,  It  should  be 
clear  from  our  discussion  that  the  sum  of  the  dimensions  of  the  predictable 
and  smoothable  subspaces  associated  with  an  external  realization  is  less 
than  or  equal  to  v.  This  fact  has  the  Intuitive  meaning  of  Indeterminacy 
introduced  by  the  presence  of  the  orthogonal  component  £. 

The  presence  of  nontrivial  Invariant  directions  allows,  as  it  should 
be  expected,  for  a reduction  in  the  dimension  of  the  filtering  algorithms 
available  In  the  literature.  For  instance,  it  is  a simple  exercise  to 
verify  that  Faurre's  algorithms  to  compute  P*  and  P*  [12;  p.56]  reduce 
to  solving  (n  - *)  x (n  - v)  matrix  equations,  the  values  of  P*  and 
(P*)  ^ on  the  subspaces  I and  I respectively  being  known  a priori  in 
terms  of  H,  F and  G.  A similar  reduction  can  be  obtained  for  the  fast 
algorithms  which  compute  the  gain  (1.4)  directly  (cf.  [17]  for  example), 
since  it  is  clear  that  in  an  invariant  direction  the  value  of  the  gain 
can  be  computed  directly  in  terms  of  the  system  matrices. 
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3.2  Noise  free  stochastic  realization  and  the  singular  case. 

Akaike,  in  his  important  paper  [1],  deals  with  Markovian  representa- 
tions of  the  process  z without  noise  in  the  output  and  only  in  his 
concluding  remarks  discusses  representations  with  additive  noise  terms. 

Indeed,  his  work  was  based  on  some  results  of  Faurre  [11]  which,  starting 
from  a certain  factorization  of  the  covariance  matrices,  were  phrased  in 
terms  of  noise-free  realizations.  In  subsequent  work  [12]  Faurre  turned 
to  a different  factorization  of  the  covariance  matrices  which  led  natu- 

1 


rally  to  realizations  with  noise  in  the  output.  The  same  choice  has, 
since  then,  been  made  by  a number  of  authors  [13,  22,  23,  33],  but,  up 
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to  our  knowledge,  it  has  never  been  explained  whether  Che  two  approaches 
are  equivalent  and.  If  not,  what  are  the  shortcomings  of  either  one.  We 
shall  now  show  that,  precisely  in  the  singular  case,  the  first  approach 
presents  a considerable  disadvantage,  in  that  many  minimal  Markovian  real- 
izations are  lost.  Let  us  start  considering  a minimal  factorization 
(E,  @,  ¥)  (i.e.,  completely  controllable  and  observable)  like  the  one  in 
[11],  namely 


(3.5)  A - E{z(t  + j)z(t)  '}  - <?Ej0  j - 0,  1,  2,  ... 


and  let  dim  E - r.  On  the  other  hand,  since  $ is  the  double  side 
z-transf orm  of  A,  we  have 


(3.6) 


HF^_1G  j - 1,  2,  3,  ... 
G ' (F  ')  _1H  ' + *(®)  j - 0 


Theorem  3.10  Let  k be  the  dimension  of  N($(®))  and  assume , with- 
out loss  of  generally,  that  $(®)  - [R'O]  lihere  R is  (m  - k)  x m. 
Then  (S,  0,  ¥)  is  given , up  to  a change  of  basis,  by 

(3.7)  (5.  9,  V - (s’-pR6].  [h  (J)]) 


where 


0 

0 


the  identity  matrix  is  m - k dimensional  and  r - n + m - k. 

Proof.  It  is  easy  to  check  that  the  triplet  in  (3.7)  satisfies  (3.5). 


Also  (E,  0)  Is  controllable  and  (E,  f)  is  observable.  In  fact  sup- 
pose a with  a.  e Rn  and  a.  e Rm  k is  such  that 


F-1G  G FG 


(3.8)  (c^,  ap 


pn+o-k-2^ 


R 0 0 


Then  we  see  that  must  be  zero,  which  forces  a^R  “ 0 and  finally 

■ 0.  We  conclude  that  the  controllability  matrix  in  (3.8)  is  full 
rank.  Similarly  the  observability  matrix  is  seen  to  have  rank 
n + m - k.  The  conclusion  now  follows  from  the  uniqueness,  up  to  an 
equivalence  as  in  (2.1),  of  the  triplet  (E,  9,  '?).  // 


Let  us  assume  for  the  moment  that  $(«)  is  nonsingular  and  consider 
a proper  stochastic  realization  of  z [F,  B,  H,  D;  w] . Then  we  can 


associate  to  it  the  noise  free  model 


(3.9)  C(t  + 1)  - F5(t)  + 


(3.10)  z(t)  - [H  I]5(t) 


F-1B 

D - HF_1B 


where 


F_1x(t  + 1) 

(D  - HF_1B)w(t) 


and  n(t)  “ w(t  + 1).  This  induces  a one-to-one  correspondence  between 
wide  sense  realizations  of  the  form  [F,  B,  H,  D]  and  noise  free  wide 
sense  realizations  of  the  form  [F,  x»  (HI)]  which  are  minimal  too  in 
view  of  Theorem  3.10.  If  we  agree  to  call  realizations  [F,  x>  (H  I)  ; h] 
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with  x (n  + ®)  x m Internal,  then  the  above  correspondence  is  one-to- 

one  between  Internal  realizations.  In  particular  It  maps 
1/2 

[F,  B*,  H,  R(P^)  ; «*1  to  a realization  related  to  the  steady  state 

pure  filter , i.e.,  the  second  Innovation  representation  IR^  In  Gevers 
terminology  [14] . 

Suppose  now  that  $(«)  Is  as  in  Theorem  3.10  with  k > 0.  Then 
it  is  possible  to  set  up  a correspondence  similar  to  the  one  in  the 
nonsingular  case  only  for  a rather  small  subclass  of  wide  sense  realiza- 
tions. More  explicitly,  let  [F,  B,  H,  D;  w]  be  a realization  such 
that  T(0)  ” D ' - B'(F')  ^H'  has  rank  m - k and  V an  orthogonal 


—1  s 

matrix  such  that  [D  - HF  B] V ■ ^ 


where  S is  (m  - k)  x p,  p 


being  the  number  of  columns  of  B.  Then  we  have  the  n + m - k dimen- 


sional noise  free  model 


(3.11) 


(3.12) 


£(t  + 1)  - F£(t)  + 


F_1B 

i 

_ S _ 


[■  (;)]' 


where  £(t) 


F_1x(t  + 1) 
SV  'w(t) 


and  n(t)  * V'w(t  + 1).  The  wide  sense 


realization  given  by  (3.11)-(3. 12)  is  minimal . This  establishes  a one- 

to-one  correspondence  between  minimal  wide  sense  realizations  of  z 

such  that  T(0)  has  rank  m - k and  minimal  wide  sense  realizations  of 
~ (’ll 

the  form  [F,  X»  (H  qJ)]*  It  is  now  apparent  that  the  choice  of  seeking 
noise  free  representation  of  z can  cost  us,  in  the  singular  case,  the 
loss  of  a considerable  number  of  realizations.  Indeed,  it  is  not  hard 
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Co  see  that  the  subseC  of  P corresponding  Co  realizations  wlch  rank 
T(0)  ■ m - k lie,  as  Q,  in  the  relative  boundary  of  P. 

This  shows  that,  in  discrete  time,  the  factorization  (3.6)  and  the 
associated  choice  of  H , (z),  instead  of  H (z),  as  past  space  at  time 

t X L 

t,  is  more  convenient,  even  though  it  implies  the  unpleasant  fact  that 
white  noise  processes  have  zero  dimensional  minimal  realizations. 
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